CDS

Accession Number TCMCG058C28129
gbkey CDS
Protein Id KAF7149966.1
Location join(554188..554445,556250..556623,556983..557050,557172..557239,558234..558341,560633..560881,562161..562534,563119..563311)
Organism Rhododendron simsii
locus_tag RHSIM_Rhsim02G0005600

Protein

Length 563aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA588298, BioSample:SAMN13241185
db_source WJXA01000002.1
Definition hypothetical protein RHSIM_Rhsim02G0005600 [Rhododendron simsii]
Locus_tag RHSIM_Rhsim02G0005600

EGGNOG-MAPPER Annotation

COG_category S
Description 4,5-DOPA dioxygenase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R08836        [VIEW IN KEGG]
KEGG_rclass RC00387        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K15777        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00965        [VIEW IN KEGG]
map00965        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGCAGGCGAAGGTGAAGGAGACATACTACATATCACATGGGTCACCCACGCTTTCCATCGACGATTCGTTGCCGGCCAGGGGTTTCTTGAAATCGTTTAGGGACACCGTCTACGGCGGCGCACAGAGACCACCGCCTACCTCAATCCTCGTCATCTCCGGCCACTGGGAGACCGATTACCCCACCGTCAACGCCGTCTCCGGCACCTGCGACACCATCTATGATTTCTACGGCTTCCCCCAAGAGATGTATAAGCTCAAGTATCCGGCACCGGGAGCTCCACAATTAGCGACAAGGGTGAAAGAATTACTAACGGCATCTGGGTTCCAACGGGTGAGTGTGGATAAAAAGCGTGGGCTAGACCATGGAGCTTGGGTGCCACTCATGCTCATGTTCCCAGAGGCTGATATACCAGTGTGCCAGCTCTCAATCCAGACCAATAGGGATGGAACTTATCATTATAATCTGGGGAAAGCATTGGCTCCTCTCAAGGAGGAAGGTGTCCTTATTATTGGGTCTGGTTCTGCCACTCACAACCTGAGGGCCCTACGACAGTCCAGAGATGGCTCTGTTGCTTCTTGGGCTTTGGAGTTCGATACGTGGCTCAAAGACGCCCTTCTCAATGGAAGGAGTGAAGGGGACTTGTCCACTGCCTTTAATAGTAACACCAGAGCAAATTTGATAAAGTACATTAGAGTTGGTTATGAAACCCATGGTATCATGCGTAGATGGGAGATGTTGAATAAATCACTGAAACAGATACAGACAGTAGAGGCTACTTTGGTTTTGTTGCAAGCGAAGGCAGTGGAGGAGAGAAAAGTGCAAGTCAAAAGTGGAAAAGGGGAGAGAGAACGCTTGTTTATCCTCGGCAAGGCGAAGGTGAAGGAGACATACTACATATCACATGGGTCGCCAACGCTTTCCATCGACGATTCGTTGCCGGCCAGAGGTTTCTTGAAATCGTTCAGGGACACCGTATACGGCGGCGTACAGAGACCACCGCCCACCTCCATCCTCATCATCTCCGGCCACTGGGAGACCAATTACCCCGCCGTCAATGCTATCTCCGGCACCTGCGATACCATCTATGATTTCTACAACTTCCCCCAAGAGATGTATAAGCTCAAGTATCCAGCACCGGGAGCTCCAGAATTAGCGACAAGGGTGAAAGAATTACTAATGGCATCAGGGTACCAAAAGGTGAGCGTGGATAAAAAGCGTGGGCTAGACCATGGAGCTTGGGTGCCACTCATGCTCATGTTCCCAGAAGCTGATATCCCAGTGTGCCAGCTCTCCGTCCAGACTAATAGGGATGGAACTTACCATTATAATCTTGGAAAGGCATTGGCTCCTCTCAAGGAGGAAGGTGTCCTCATTATTGGTTCTGGTTCCGCCGTTCACAACTTGAGGGCCCTAAGCCTGTCCGGGGATGGTTCCGTTGCTGCTTGGGCTTTGGAGTTCGATACATGGCTCAAAGATGCCCTTCTCGATGGAAGGTATGAAGATGTCAACCACTACGAAGAGAGAGCACCGCATGCAAAAGCGGCACACCCGAGGCCAGACCACTTCTATCCACTGCATGTAGCCATTGGTGCCGCGGGTGAAAATGCAAAAGCTGAACTAATCCACAATAGCTGGCAGCTTGGCACGCTTTCCTATGCCTCCTACAAGTTCACACCAACTGAATGA
Protein:  
MAQAKVKETYYISHGSPTLSIDDSLPARGFLKSFRDTVYGGAQRPPPTSILVISGHWETDYPTVNAVSGTCDTIYDFYGFPQEMYKLKYPAPGAPQLATRVKELLTASGFQRVSVDKKRGLDHGAWVPLMLMFPEADIPVCQLSIQTNRDGTYHYNLGKALAPLKEEGVLIIGSGSATHNLRALRQSRDGSVASWALEFDTWLKDALLNGRSEGDLSTAFNSNTRANLIKYIRVGYETHGIMRRWEMLNKSLKQIQTVEATLVLLQAKAVEERKVQVKSGKGERERLFILGKAKVKETYYISHGSPTLSIDDSLPARGFLKSFRDTVYGGVQRPPPTSILIISGHWETNYPAVNAISGTCDTIYDFYNFPQEMYKLKYPAPGAPELATRVKELLMASGYQKVSVDKKRGLDHGAWVPLMLMFPEADIPVCQLSVQTNRDGTYHYNLGKALAPLKEEGVLIIGSGSAVHNLRALSLSGDGSVAAWALEFDTWLKDALLDGRYEDVNHYEERAPHAKAAHPRPDHFYPLHVAIGAAGENAKAELIHNSWQLGTLSYASYKFTPTE